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CN " Abstract 

We introduce a generalization of the well known graph (vertex) coloring problem, which 



we call the problem of component coloring of graphs. Given a graph, the problem is to 
color the vertices using the minimum number of colors so that the size of each connected 
component of the subgraph induced by the vertices of the same color does not exceed C . 
We give a linear time algorithm for the problem on proper interval graphs. We extend this 
algorithm to solve two weighted versions of the problem in which vertices have integer 
weights. In the splittable version the weights of vertices can be split into differently 
. colored parts, however, the total weight of a monochromatic component cannot exceed C . 

For this problem on proper interval graphs we give a polynomial time algorithm. In the 



CO 
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version we give a 2-approximation algorithm for the non-splittablc problem on proper 



non- splittable version the vertices cannot be split. Using the algorithm for the splittable 



interval graphs which is NP-hard. We also prove that even the unweighted version of the 
CN1 ■ problem is NP-hard for split graphs. 
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1. Introduction 

The vertex coloring problem is to color the vertices of a graph using the minimum 
number of colors so that no two adjacent vertices are assigned the same color. In this 
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paper, we introduce and study a generalization of the vertex coloring problem. In this 
generalized problem, called the problem of component coloring of graphs, we allow two 
adjacent vertices to be assigned the same color. It is customary to consider two variations: 
unweighted and weighted. In the unweighted version of the problem, given an graph 
G = (V,E), the objective is to color the vertices using the minimum number of colors 
such that the size of any monochromatic component, i.e., the connected component of 
the subgraph induced by the vertices of the same color, does not exceed C. The vertex 
coloring problem is a special case of the unweighted component coloring problem where 
C = 1 , and each monochromatic component consists of a single vertex. 

In the weighted version of the problem, given an graph G = (V, E) and for each v e V 
a rational weight W(v) e (0, 1], the objective is to color the vertices using the minimum 
number of colors such that the total weight of any monochromatic component, does not 
exceed 1. 

Since the vertex coloring problem is NP-hard on general graphs [lj], the unweighted 
(and hence weighted) component coloring problem is also NP-hard on general graphs. 

Our formulation of the component coloring problem is motivated by a problem on 
scheduling transmission requests on light-trails, a hardware solution for bandwidth pro- 
visioning in optical WDM (Wavelength Division Multiplexing) networks [2j. In a path 
network of processors using light-trails, each processor has an optical shutter for each 
wavelength which can be configured to be switched ON/OFF for allowing/blocking the 
light signal pass through it. For each wavelength, by suitably configuring the optical shut- 
ter at each processor, the logical path network can be partitioned into subpath networks 
in which multiple transmissions can happen in parallel, provided the total bandwidth 
requirement of the transmissions assigned to a subpath does not exceed the capacity of a 
wavelength. Such subpaths, in which only the end processors have their optical shutters 
blocked, are called light-trails. A light-trail can serve only the transmissions having both 
source and destination within the light-trail. If a transmission is assigned to a light-trail, 
it uses the complete physical span of the light-trail. Given a set of transmission requests, 
each with a bandwidth requirement, the scheduling problem is to configure the optical 
shutters at the processors so that the minimum number of wavelengths is required by 
the light-trails to serve all transmission requests. 
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A graph G is an interval graph if there exists a family X of intervals in a linearly 
ordered set (like the real line), and there exists a one-to-one correspondence between the 
vertices of G and the intervals in X such that two vertices are adjacent if and only if 
the corresponding intervals intersect. If no interval of X properly contains another, set 
theoretically, then G is called a proper interval graph. 

The light-trail scheduling problem on path networks can be posed as a component 
coloring problem on interval graphs as follows. For each transmission request, create 
a vertex with weight equal to the bandwidth requirement, expressed as a fraction of 
the wavelength capacity. Two vertices are adjacent if the corresponding transmissions 
overlap, i.e., they use at least one common link. Given a solution to the component 
coloring problem, a solution to the light-trail scheduling problem can be constructed as 
follows. For each of the used colors, use a separate wavelength. For each wavelength, 
construct a separate light-trail for each monochromatic component of the corresponding 
color. Note that the light-trails on a wavelength do not intersect with each other. All 
transmission requests corresponding to the vertices of a monochromatic component are 
served by the corresponding light-trail. The physical span of the light-trail is the union 
of the physical spans of all requests in it. For each wavelength, the optical shutters in 
only the processors at the endpoints of all light-trails on the corresponding wavelength 
are configured to be OFF; optical shutters of other processors are configured to be ON. 

As mentioned in 3], the light-trail scheduling problem is similar to the problem 
of scheduling in reconfigurable bus architectures [J, Q , and hence component coloring 
applies there too. 

The unweighted component coloring problem for C = 1, i.e., the vertex coloring 
problem, has a polynomial time algorithm on interval graphs However, the complexity 
of the problem on interval graphs for general C is not known. In this paper we give a 
polynomial time algorithm for the problem on proper interval graphs for general C . Since 
the problem arises in scheduling light-trails on path networks, we assume that an interval 
representation of the graph is also available. Our first result is the following. 

Theorem 1. Given a proper interval graph G = (V,E) with an interval representation, 
there exists an algorithm that solves the unweighted component coloring problem on G in 
0{\V\) time. 
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We also consider a splittable weighted version of the component coloring problem in 
which each vertex of the input graph has an integer weight which can be divided among 
multiple copies of the vertex and these copies can be colored separately. However, the 
total weight of a monochromatic component in the resultant graph should not exceed C. 
Again, this is motivated by a variation of the light-trail scheduling problem in which 
the bandwidth requirement of a transmission can be divided into multiple transmissions 
between the same source-destination pair. We extend the algorithm for the unweighted 
problem to solve this splittable weighted problem on proper interval graphs. So our 
second result is the following. 

Theorem 2. Given a proper interval graph G = (V,E) with an interval representation, 
there exists an algorithm that solves the splittable weighted component coloring problem 
on G in 0(\V\ 2 ) time. 

However, the (non-splittable) weighted version of the problem is NP-hard even on 
proper interval graphs. This comes from the fact that the complete graph K n is a proper 
interval graph and the weighted component coloring problem on K n is an instance of 
NP-hard Bin Packing problem We use the algorithm for the splittable weighted 
problem to get a 2-approximation algorithm for the non-splittable weighted problem on 
proper interval graphs. 

Theorem 3. Given a proper interval graph G with an interval representation, there 
exists a 2-approximation algorithm for the non-splittable weighted component coloring 
problem on G. 

The vertex coloring problem also has a polynomial time algorithm for split graphs, 
i.e., when the vertex set can be partitioned into an independent set and a clique Q. 
However, for general C, we prove that the unweighted component coloring is NP-hard 
on split graphs. So our final result is the following. 

Theorem 4. The component coloring problem is NP-hard for split graphs. 

The rest of the paper is organized as follows. We begin in Section[2]by comparing our 

work with previous related work. In Section [3] we present some pertinent definitions and 

known results. In Section 2] we show that for the class of chordal graphs, the component 
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coloring problem is equivalent to a vertex partitioning problem. Note that the interval 
graphs and the split graphs are chordal. We show in Section [5] that for the class of 
proper interval graphs, it is enough to solve a simpler version of the partitioning problem 
which we call the block-partitioning problem. We give an LP based algorithm for the 
block-partitioning problem in Section |6l We give a combinatorial algorithm for the same 
problem in Section [7] In Section [5] we extend this algorithm to solve the splittable 
weighted problem. Based on the algorithm for the splittable weighted problem we give 
a 2-approximation algorithm for the non-splittable weighted problem in Section ]§\ We 
prove the NP-hardness of the problem on split graphs in Section 1101 



2. Previous Work 

In the graph coloring literature, there are papers 0, Q to solve a problem that is a 
kind of dual to the unweighted component coloring problem. Here, the objective is to 
minimize the size of the largest monochromatic component in a coloring using a fixed 
number of colors. The paper [8] shows that for a n-vertex graph of maximum degree 4, 
there exists an algorithm that uses 2 colors and produces a coloring in which the size 
of the largest monochromatic component is O(2( 21og2 ™) / ). For a family of minor- closed 
graphs, the paper Q shows that if A colors are used, the size of the largest monochromatic 
component is in between fl(n 2 ^ 2X ^ 1 ' 1 ) and 0(n 2 ^ A+1 ^) for every fixed A. However, in 
our knowledge, there is no work in the graph coloring literature for the versions of the 
problem we formulated. 

The NP-hard light-trail scheduling problem with arbitrary bandwidth requirements 
on ring networks and general networks has generally been solved using heuristics and 
evaluated experimentally Il5| without any bound on the performance. 

For path/ring networks, the paper [3| gives an approximation algorithm that uses 0(uj + 
logp) wavelengths where p is the number of processors in the network and u> is the 
congestion, i.e., the maximum total traffic required to pass through any link. For the 
corresponding component coloring problem, p is the number of distinct end points of the 
intervals in the given interval representation, and uj is the weight of a maximum clique 
and hence a lower bound on the number of colors used. Thus the algorithm in [3| is a 
constant factor approximation algorithm with an additive term \ogp for the component 



coloring problem on interval/circular- arc graphs. Note that in general p « In. 



3. Preliminaries 

Throughout this paper, let G = (V, E) = (V(G),E(G)) be a simple, undirected graph 
and let n = \V\ and m = \E\. We also assume that G is connected. If G is not connected, 
the results in this paper can be applied separately to each of its connected components. 
The set of vertices adjacent to a vertex v e V is represented as N(v). For a set S c V, the 
sub-graph of G induced by S is G[S] = (S,E(S)) where E(S) = {(u,v) e E \ u,v e S}. 
For a set S cz V, V — S denotes G[V\5]. A clique of G is a set of pair wise adjacent 
vertices of G. The size of a clique is the number of vertices in it. A maximal clique is 
a clique of G that is not properly contained in any clique of G. A maximum clique is a 
clique of maximum size. The clique number of G, denoted by w(G) or simply ui, is the 
size of a maximum clique of G. An independent set of G is a set of pairwise non-adjacent 
vertices in it. 

A weighted graph G = (V, E, W) has a weight W(v) e Z^o associated with each vertex 
v e V. The weight-split graph of a weighted graph G = (V, E 1 , W), in short W5P(G), 
is the weighted graph G' = (V',E',W') such that the weight of each v e V is divided 
among a separate set of vertices Vi,...,v„ v in V', i.e., Yljli W'(vj) = W(v) and for 
each edge (u, v) e E there is an edge (ttj, Vj) e E' for all z = 1, . . . , n u and j = 1, . . . , n„. 
The weight-expanded graph of a weighted graph G = (V,E,W), in short WJP(G), is 
the unweighted graph G' = (V, E') such that if we put a weight 1 to each vertex in V' 
then the resulting weighted graph is a weight-split graph of G. We will use the following 
notations: n' = \V'\ and ml = \E'\. 

Coloring of a graph is an assignment of colors to its vertices. A X-assignment of 

a graph G = (V, E) is a map from V to some set of A colors such as {1, . . . , A}; this 

assignment may not be 'proper' in the standard notion of graph (vertex) coloring that 

two adjacent vertices must be assigned different colors. A color class i is the set of vertices 

assigned color i under the A-assignment. A monochromatic component of G under a A- 

assignment is a component of the sub-graph induced by a single color class, or in other 

words, a maximal connected monochromatic sub-graph. Following the terminology of 

we call a monochromatic component a chromon. The size of a chromon is the number 
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of vertices in it. For a weighted graph, the weight of a chromon is the sum of weights of 
vertices in it. 

An unweighted (weighted) graph is [A, C]-colorable if it has a A-assignment in which 
every chromon has size (weight) at most C and such an assignment is called a [A, C]- 
coloring. A C-component coloring of graph G is a [A, C] -coloring with the minimum A. 
Sometimes we will simply refer to the problem of finding a C-component coloring of a 
graph as the coloring problem. 

A weighted graph G is [X,C]-split colorable if it has a weight-split graph G' which 
is [A, C]-colorable and such a coloring is called a [A, C]-split coloring of G. A C-split 
component coloring of graph G is a [A, C]-split coloring with the minimum A. Sometimes 
we will simply refer to the problem of finding a C-split component coloring of a graph 
as the split coloring problem. 

The component coloring problem can be seen as solving two problems simultaneously, 
(i) partitioning the vertex set into chromons and (ii) assigning colors to the chromons. 
The partitioning should be such that if each part is contracted to a single vertex, the 
resulting graph can be colored using as few colors as possible. Since the size of a maximum 
clique in the contracted graph plays a major role in determining the number of colors 
used, at least for some graphs classes such as perfect graphs, we have to ensure that the 
cliques in the original graph does not intersect too many parts. We formally define the 
partitioning problem as follows: 

Definition A graph G = (V, E) is said to have a [A, C]-partition if and only if there is 
a partition II = {Pi, P 2 , . . . , P t } of V, Pi c V, P t n P, = for all i jt j such that the 
following constraints are satisfied: 

• connectedness - the subgraph induced by each part Pi, i.e., is connected, 

• size - each part Pi has at most C vertices, and 

• clique intersection — any clique in G intersects at most A parts (A will subsequently 
be called the clique intersection of the partition). 

A C-component partition of a graph is a [A, C]-partition with the minimum A. We will 
refer to the problem of finding a C-component partition as the partition problem. 
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We study the coloring problem on interval graphs and split graphs. Each of these 
classes of graphs is a subclass of the class of chordal graphs. A graph is chorda! if each 
of its cycles of four or more vertices has a chords which is an edge joining two vertices 
that are not adjacent in the cycle. There are many characterizations of chordal graphs 
(see for more details). We will use the characterization of a chordal graph based on 
perfect elimination ordering or, in short, PEO. A vertex v of G is called simplicial if its 
neighbors N{v) form a clique. An ordering a = \v\, ■ ■ ■ , v n ] of vertices is a PEO if 
each vertex Vi is a simplicial vertex of the induced subgraph G\vi, . . . , v n ]. 

Proposition 5 (Q). Let G = (V, E) be an undirected graph. Then G is a chordal graph 
if and only if G has a PEO. Moreover, any simplicial vertex can start a PEO. 

A graph G = (V, E) is a split graph if there is a partition V = S + Q of its vertex 
set into an independent set S and a clique Q. There is no restriction on edges between 
vertices of S and Q. A graph G = {V, E) is an interval graph if there exists a family 
I = {I v | v e V} of intervals on a real line such that for distinct vertices u, v in G, 
(it, v) 6 E if and only if I u n I v ¥= 0. Such a family I of intervals is commonly referred 
to as the interval representation of G. Given an interval representation of G, consider a 
cycle of more than 3 vertices, and the corresponding intervals in ascending left endpoints. 
Since the rightmost interval intersects the leftmost interval, it also intersects the intervals 
in between them. Hence G is also chordal. It will be convenient to let Left(I v ) and 
Right(I v ) stand for the left and right endpoint of the interval I v , respectively. The 
family I is the interval representation of a proper interval graph (PIG) if and only if no 
interval is properly contained in another. Interval graphs and split graphs are easily seen 
to be chordal [7|. 

Proposition 6 (Q). There exists an 0(m + n) time algorithm to get an interval repre- 
sentation of a given interval graph. □ 

However, since the component coloring problem is motivated by the light-trail schedul- 
ing problem, in this paper we will assume that an interval representation I = {I v \ v e V } 
is given for the input PIG G = (V, E). 

Now consider the linear order < on V defined as follows. For u, v e V, u < v if and 
only if Left(I u ) < Left(I v ) or [(Left(I u ) = Left(I v )) and (Right(I u ) ^ Right{I v ))). 



We call this ordering V\ < 1)% < ■ ■ ■ < v n the canonical ordering. In the rest of the 
paper, we use numbers 1 to n to represent the vertices where i represents the vertex that 
appears ith in the canonical ordering. Hence, v will be interchangeably used to represent 
a vertex v e V as well as its position in the canonical ordering. If u < v then u is said to 
be on the left of v and v is said to be on the right of u. 

Proposition 7 ([6]). A graph G = (V,E) is an interval graph if and only if there exists 
a linear order < on V such that for every choice of vertices u,v,w with u < v < w. 
(u,w) e E implies (u, v) e E. □ 

For PIGs the canonical ordering not only satisfies the conditions in Proposition[7]but, 
in fact, satisfies a stronger property: 

Proposition 8 ("The Umbrella Property" Q). A graph G = (V, E) is a PIG, if and 
only if, there exists a linear order < on V such that for every choice of vertices u,v,w, 
with u < v < w, (u,w) e E implies both (w, v) £ E and (v,w) e E. □ 



A Woc^o in a PIG a is a set of vertices which are consecutive in the canonical ordering. 
We will represent a block starting at a vertex u and ending at a vertex v as the interval 
An immediate corollary of Proposition [5] is that every edge (u, v) e E induces 
a clique Also, any maximal clique of a PIG can be represented by a single edge 

between the two end vertices, say (u, v), or by the block 

Corollary 9. Let S be a connected subgraph of a PIG and v±, t>2, . . . , v t be the vertices 
of S arranged in the canonical ordering. Then there must be an edge (vi,Vi + i) for all 
i = l,...,t- 1. 

Proof. Consider the two vertices i>i and Uj+i. Since S is connected there must be an 
edge (vj,Vk) where j ^ i and i + 1 ^ k. Then [vj, Vk] is a clique. Thus there is an edge 

(Vi,V i+ i). □ 

Proposition 10. If an interval representation is given for a PIG G, then the vertices 
of G can be arranged in canonical ordering in 0(n) time. 



Some authors use the term block to represent what we call a clique. 
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Proof. Since there are at most t < 2n endpoints of all intervals in the representation, the 
intervals can be sorted in canonical ordering using bucket sort in 0(n) time. □ 

Proposition 11. If an interval representation is given for a PIG G, then the maximal 
cliques of G can be found in 0(n) time. 

Proof. Let 1 = {I v \ v e V} be an interval representation of G. Without loss of generality 
we assume that the endpoints of all intervals are unique. Otherwise we can suitably 
extend some of the intervals on either side so that all endpoints become distinct without 
altering the maximal cliques. We construct the sorted array A(l, . . . , 2n) of all endpoints 
in 0{n) time using bucket sort. The maximal cliques are identified as follows. Traverse 
A left to right and whenever A(i) is Left(I v ) and A(i + 1) is Right(I u ) for some u, v in 
V then output [u, v]. Clearly u and v are adjacent and hence [u, v] is a clique. Since u is 
the leftmost possible and v is the rightmost possible for such a clique, [u, v] is a maximal 
clique. The traversal takes 0(n) time. □ 

4. Equivalence of Coloring and Partition on Chordal Graphs 

Lemma 12. If a graph G has a [X,C]- coloring then it has a [A, C] -partition. 

Proof. Suppose G has a [A, C]-coloring C. Consider the partition II induced by C where 
each part is exactly a chromon. The connectedness constraint is immediately satisfied. 
Since a chromon has size at most C, the size constraint is also satisfied. Since any pair 
of vertices in a clique is directly connected by an edge, the chromons in C intersected by 
a clique are all of different colors. Hence, a clique intersects at most A parts in II. Thus 
the clique intersection constraint is also satisfied. Hence, n is a [A, C]-partition. □ 

Next we will show that for chordal graphs the converse is also true. 

Lemma 13. If a chordal graph G has a [A, C]-partition then it has a [X,C]-coloring. 

Proof. Suppose G has a [A, C]-partition II = {Pi, P 2 , . . . , Pt}. We prove that there exists 

a [A, C]-coloring of G in which each Pi is a chromon. Let the colors be numbered 1,2, 

We prove by induction on number of vertices n. For n = 1, assigning color 1 to the single 
vertex gives a [A, C]-coloring for any A, C > 1. 
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For n > 1, let u be a simplicial vertex of G. Without loss of generality assume u e Pi. 
Consider the graph G" obtained by removing u from G. Then II' = {Pi\{u}, P2, ■ . . , Pt} 
is a [A, C]-partition for G' . By induction, there is a [A, C]-coloring C of G' in which each 
part of II' is a chromon. We obtain a coloring C of G as follows. If \P\ \ > 1 we assign the 
color of other vertices in Pi to u too. Otherwise we assign u the lowest numbered color 
that is not assigned to any of the neighbors of u in C . To show that C is a [A, C]-coloring, 
it is enough to show that at most A colors are used in C. For |Pi| > 1 it is obvious as no 
new color is used. For |Pi| = 1 if it requires A + 1 colors then it implies that the clique 
u u N(u) intersects A + 1 parts which is not possible. □ 

Thus, on chordal graphs, solving the coloring problem is equivalent to solving the 
partition problem. In the rest of the paper we solve the partition problem only because 
the solution can be converted to a solution to the coloring problem using the procedure 
described in the proof of Lemma 1131 

5. Equivalence of Coloring and Block-partition on PIGs 

For PIGs, we introduce a more restricted way of partitioning the vertex set. 

Definition A PIG is said to have a [A, C]-block partition if it has a [A, C]-partition in 
which each part also satisfy consecutiveness constraint, i.e., each part is also a block. 

Lemma 14. A PIG G has a [A, C]-partition if and only if G has a [A, C]-block partition. 

Proof. A [A, C]-block partition is also a [A, C]-partition. Now suppose G has a [A,C]- 
partition LI. If the parts in LI also satisfy the consecutiveness constraint, we are done. So 
assume not. We convert LI to a new partition LI' that also satisfies the consecutiveness 
constraint. The conversion is done by exchanging vertices among the parts in LI, step- 
by-step, as follows. 

We call a vertex u to be terminal if u and some v > u + 1 belong to one part but 
u + 1 belongs to a different part, non-terminal otherwise. Let Pi be the leftmost part 
whose vertices are not consecutive. Let i e Pi be smallest terminal vertex such that i + 1 
is in some Pi ^ Pi, and there exists i + k e Pi for some k > 1. We will show how to 
repartition P = Pi u P2 into parts P[ and P^ such that in the new partition, each vertex 
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in the range [1, i] is a non-terminal vertex. Then by repeating this process all vertices 
can be made non-terminal and hence consecutiveness constraint will be satisfied. Note 
that P is connected as both Pi,P2 are connected and P2 has a vertex in between two 
vertices of P\. There are two cases. 

Case 1: There are at most C vertices in P to the right of i. In this case we set P 2 
to be the vertices in P to the right of i, and the P[ to be the vertices in P to the left 
of and including i. Clearly, i is no more a terminal vertex. Since P is connected, the 
vertices of P considered in the canonical ordering form a path. P[,F% are formed by 
breaking this path in the middle, so P{,P 2 are both connected. Let Q be any maximal 
clique which intersects P[, P 2 . Since we know that the vertices of Q are consecutive, and 
i is the rightmost vertex in P[ and i + 1 the leftmost vertex in P 2 , the vertices i,i + 1 
must be in Q. Thus Q intersects Pi, Pa a s well. All other parts intersecting Q remain 
unchanged, so the number of parts intersected by Q is the same in the new partition as 
the old. 

Case 2: There are more than C vertices in P to the right of i. In this case we set P 2 
to be the C rightmost vertices in P, and the remaining go to P[. As before we see that i 
is no more a terminal vertex and P{ , P 2 satisfy the connectedness property. Consider a 
maximal clique Q that intersects P[ , P 2 . We show that it must intersect the same number 
of parts in the new partition as the old. Q must contain the rightmost vertex u of P[ 
and leftmost vertex v of P 2 . 

Note first that P[ contains both i, i + 1, i.e., it has at least one vertex from Pi and 
one vertex from P 2 . But P 2 has C vertices, so they cannot all be from Pi, or all from 
P2 because both P\,P2 had at most C vertices each. Thus P' 2 also contains at least one 
vertex j from Pi and one vertex k from P 2 . Since i,j e Pi, there must be a path in Pi 
from i to j. There must exist an edge (u',v') in this path such that v! -i, u, and v < v' 
(see Fig. [1]). Since Q is maximal, it must contain u',v'. Thus Q intersects Pi. In a 
similar manner, we see that it must intersect P 2 . Thus it follows that Q intersects the 
same number of parts in the old and new partitions. □ 

There is a simple example of a general (non-proper) interval graph where Lemma [Til 
does not work. Consider the example graph given by the intervals in canonical order- 
ing: a = [1,9], b = [2,5],c = [3,6],d = [4, 12], e = [7,10],/ = [8,11]. It has two 

12 



i = v! i + 1 u v k v' 3 

Figure 1: Sketch showing clique intersection remains unchanged 

maximal cliques Q\ = {a, b, c, d}, Qi = {a, d, e, /}. For C = 2, the optimal partition 
{{a, (i}, {6, c}, {e, /}} has clique intersection 2 but the part {a,d} is not a block as a, d 
are not consecutive according to canonical ordering. All block partitions have clique 
intersection 3 or more. The reason is as follows. If a is the only vertex in a part then to 
cover the remaining 3 vertices of Qi we need at least 2 more parts. On the other hand 
if a is paired with b then to cover the remaining 3 vertices of Q2 we need at least 2 more 
parts. 

Since for a PIG, the notions of partition and block partition are equivalent, in the rest 
of the paper we will abuse the notation [A, C]-partition to actually mean a [A, C]-block 
partition in the context of PIGs. 

Lemma 15. Given a PIG G with an interval representation, if there is an 0(f(n)) 
algorithm to solve the (block) partition problem on G, then there is an 0(n + f(n)) 
algorithm to solve the coloring problem on G. 

Proof. We first get the canonical ordering of the vertices using the procedure given in 
Proposition [TUJ Suppose the partition algorithm returns a partition with clique intersec- 
tion A and the parts sorted in canonical ordering are Pi, P2, ■ ■ ■ , Pt- For each 1 < i ^ t, 
we assign color (z — 1) mod A + 1 to Pi. This is a valid coloring because otherwise, there 
is an edge (u, v) between two parts of same color implying the clique [u, v] in G intersects 
more that A parts which is not possible in a [A, C]-partition. □ 

6. An LP Based Algorithm for Block-partition on PIGs 

Let G = (V, E) be a PIG with vertices in V already sorted in canonical ordering and 
Q be the set of maximal cliques. Let Left(Q) denote the leftmost vertex of Q. Then 
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partition problem on G can be formulated as the following integer linear program: 



IlpPart: min A 



(1) 



i+C-l 



(2) 



Le/i(Q) + |Q|-l 



s 



; < A - 1 



(3) 



j=Left(Q) + l 



Xj e {0, 1} 



1 s$ j «S n 



(4) 



A integer 



(5) 



where Xj is a binary variable to denote if vertex j is the rightmost vertex of a block and 
A denotes maximum clique intersection by any clique. Constraint ((T|) ensures that some 
block must end at n. Constraints ensure that among C consecutive vertices there 
must be at least one vertex which is the rightmost vertex of a block because a block has 
size at most C. Since a clique Q intersects at most A blocks, constraints © ensure that 
the vertices in Q, except the rightmost, can include the rightmost vertices of at most 
A — 1 blocks. The objective is to minimize the maximum clique intersection A. 

Let LpPart be the LP relaxation of IlpPart obtained by making Xj a real variable 
in [0, 1] and making A unconstrained. 

Lemma 16. If x, X is a fractional solution to LpPart then it can be rounded to a integer 
feasible solution x, A in polynomial time. 

Proof. Consider the following rounding scheme which takes 0(n) time. We use a set of 
intermediate variables yo, y±, . . . , y n . We set 



Note that each Xj is a 0-1 variable and A is an integer. Since X\ = 1, by construction 
x± = 1. Hence x satisfies constraint {TJ). 

Now we prove that x satisfies the constraints in Since x satisfies jth of such 
constraints, Xj + Xj+i + . . . + Xj+c-i 1, i.e., yj+c-i — Uj-i 1- So there must be at 




and 




for all 1 < j < n. 
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least one index k in [j,j + C — 1] such that [y^-il \yk\ implying that Xk = 1. Thus x 
also satisfies the jth constraint in @. 

Finally we prove that x, A satisfy constraints in © too. Consider the constraint for 
clique Q and let j = Left{Q). Since x satisfies this constraint, aJj+i + Xj+2 + ... + 
X i+\Q\— i ^ A — 1, i.e., — j/j < A — 1. So there can be at most A — 1 indices 

k in [j + 1, j + \Q\ — 1] such that [yfe-il # [2/fel implying that at most [AJ — 1 of the 
corresponding XkS are set to 1. Thus x, A also satisfy the constraint for clique Q. □ 

Clearly A < A. Since the integer objective value A cannot be strictly less than the 
fractional value A, A = A and hence x, A optimally solve IlpPart. Thus by solving 
LpPart and rounding the solution using the procedure given in the proof of Lemma \W\ 
gives a polynomial time algorithm for the partition problem. 

7. A Combinatorial Algorithm for Block-partition on PIGs 

We now give a combinatorial algorithm for the partition problem on PIGs. The 
algorithm does not use LP scaffolding and hence is more efficient. 

7.1. Lower Bound 

Lemma 17. If a PIG G has a [A, C]-partition then A [(u)(G) + C-1)/C\. 

Proof. Let Q be a maximum clique of G, i.e., \Q\ = u(G). To cover all vertices of Q 
by parts of size at most C, we need at least \u(G)/C~\ parts. Hence, clique intersection 
A > \u(G)/C\ = L(w(G) + C- l)/C\. □ 

There is a simple example where A = [w/C] is not enough to have a [A, C]-partition. 
Consider the graph given by the intervals {[1,3], [2,5], [4,6]}. Here w = 2. For C = 2, 
the number of parts given by lower bound [w/C] = 1 is not enough as the single part 
would contain 3 > C connected vertices. 

7.2. Upper Bound 

Lemma 18. If an interval representation is given for a connected PIG G then there 
exists an algorithm that produces a [\(u(G) + C — 1)/C], C]-partition. 
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Proof. Consider the following algorithm which we call SimplePart. We first arrange 
the vertices of G in the canonical ordering time using Proposition 1101 Then we assign 
the block of vertices [(i — 1)C + 1, min{n, iC}] to part Pj for each 1 :g i ^ [n/C]. 

Each part Pi produced by SimplePart clearly has consecutive vertices and has size 
at most C. Since G is connected, by Corollary [9l there is an edge between Vj and fj+i 
for all j = 1,2, ... ,n — 1. Hence Pi is also connected. Thus it will be enough to show 
that any clique intersects at most A = \(oj + C — 1)/C] parts. 

If a clique Q intersects A parts P,-, . . . , Pj + a_i, then Q must contain at least one 
vertex of each of P, and P^+a-i and all vertices of remaining parts P%, . . . , P^+a-2- So 
the minimum size of Q is 1 + (A - 2)C + 1 = AC - 2C + 2. Thus u > AC - 2C + 2. Hence 
A s: (lj + 2C - 2)/C. This implies A s£ [(w + 2C - 2)/CJ = + C - 1)/C|. □ 

There is a simple example where SimplePart does not give the optimal partition. 
Consider the graph given by the intervals a = [1, 6], b = [2, 7], c = [3, 10], d = [4, 11], e = 
[5,12],/ = [8, 13], g = [9,14]. It has two maximal cliques {a, b, c, d, e}, {c, d, e, /, g} and 
w = 5. For C = 3, SimplePart produces the partition {{a, b, c}, {d, e, /}, {g}} which 
has clique intersection 3. But there exists a better partition {{a, &}, {c, d, e}, {/, g}} with 
clique intersection 2. 

However, a close analysis reveals that SimplePart is not that bad. In fact, when 
uj{G) = kC + 1 for some integer k, the two bounds match and hence SimplePart gives 
the optimal solution. Again for other values of w(G), which can be represented as kC + r 
for integer k, r such that 2 < r $5 C, the two bounds are k + 1 and k + 2 respectively, 
hence differ by 1 and one of the two bounds is optimum. Thus it will be enough to solve 
the following special case of the problem. 

Definition Partition subproblem: given a PIG G with uj(G) = kC + r, k integer and 
2 < r < C, check if there is a [k + 1, C]-partition and if so, generate the partition. 

If there is an algorithm Alg for the partition subproblem then we apply Alg to G to 
check if there is a [k + 1, C]-partition. If yes, Alg also gives the required partition. 
Otherwise, SimplePart gives an optimal solution. 

In the rest of the paper we will let k(G), or in short k, denote [(w(G) — 1)/CJ. 
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7.3. Forbidden vertices 

The key idea in our algorithm is to first identify those vertices that cannot be right 
cndpoints of a block in a possible [k + 1, C]-partition. 

Definition A vertex i in a PIG is said to be primarily forbidden if the block [i — kC, i + l] 
is a clique. 

Lemma 19. If the vertex i in a PIG is primarily forbidden then no block in a[k + 1, C]- 
partition can end at i. 

Proof. Suppose a block in a [k + 1, C]-partition ends at the vertex i. Consider the clique 
[i — kC, i+l] which must be covered by at most k + 1 blocks. To cover the vertex i+l 
we need one block. Then the remaining kC + 1 vertices [i — kC, i] must be covered by 
at most k blocks. This is not possible as the size of a block is at most C. □ 

Definition A vertex in a PIG is forbidden if it is primarily forbidden or secondarily 
forbidden, where secondarily forbidden vertices are defined recursively as follows. If (a) 
there exists a block of forbidden vertices [v — s + 1, v] where 1 < s < C — 1, and (b) the 
block [v — kC, v — s + 1] is a clique then the set of vertices P(v) = {v — qC | 1 s$ q < k) is 
secondarily forbidden. Further, we will say that v is the leader of all secondarily forbidden 
vertices in P(v) and v itself. Similarly each secondarily forbidden vertex in P(v) is a 
follower of v. 

Note that a primarily forbidden vertex is the leader of itself. Furthermore, any 
forbidden vertex i has a leader i + qC where q is an integer and ^ q s$ k. 

Lemma 20. If the vertex i in a PIG is secondarily forbidden then no block in a [k + 1, C]- 
partition can end at i. 

Proof. The leader of i is i + qC where 1 q k. Let Q be the clique [i + qC — kC, i + qC — 
s + 1] . Since the vertices [i + qC — s + l,i + qC] are forbidden, they must be covered by a 
single block, say B. The block B must end at a vertex on the right of i + qC. In the best 
case B ends at i + qC+1 and covers C — s vertices of Q, i.e., [i + (q— l)C+2, i + qC— s + 1]. 
The remaining (k — 1)C + 2 vertices [i — (k — q)C, i + (q— 1)C + 1] of Q must be covered 
by at most k blocks. 
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Now suppose a block of a [k + 1, C] -partition ends at the vertex i. To cover the 
(k — q)C + 1 vertices [i — (fc — q)C, i] we need at least (k — q) + 1 blocks. The remaining 
(q — 1)C + 1 vertices [i + 1, i + (q — 1)C + 1] must be covered by at most q — 1 blocks. 



Our algorithm is as follows. We mark all forbidden vertices, and then try to form 
blocks by a greedy left to right strategy. 

7.4- Marking forbidden vertices 

The algorithm for marking forbidden vertices is given in Algorithm [TJ The algorithm 
assumes that we are given an array Lmn, where Lmn(i) denotes the leftmost neighbor of i. 
It is easily seen that Lmn can be computed in 0(n) time given an interval representation 
of the input graph. The algorithm constructs the array F, where F(i) = 1 if and only if 
i is forbidden. 

Algorithm 1: M ARkForbidden 
Input : Lmn(l, . . . , n) for a PIG G = (V, E) 

Output: F(l, ...,n) 

1 foreach i = 1 to n do F(i) = Ldist(i) = 0; 

2 foreach i = n downto 1 do /* phase 1 */ 

3 if Lmn(i) i — kC — 1 then 



5 Rnf {n) = n; /* Rnf (i) is rightmost non-forbidden vertex j where j ^ i */ 



This is not possible as the size of a block is at most C. 



□ 



4 



F(i-1) = 1; 



6 foreach i = n — 1 downto 1 do 



/* phase 2 */ 



8 



9 



7 



Rnf (i) = min{i,Rnf(i + 1)}; 

while F(Rnf (i))==l do Rnf (i) = Rnf (i) - 1; 

if (F(i) = = l) and (Ldist(i) ^ (fc - 1)C) then 



/* i is a follower */ 



/* Extend Rnf(z) */ 
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F(i-C) = l; Ldist(i - C) = Ldist(i) + C; 



n 



12 



if Lmn(Rnf (i) + 1) i - kC then 
F(i - C) = 1; Ldist(i - C) = C; 



/* i is a leader */ 
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In phase 1, the primarily forbidden vertices are marked. We check at each vertex i if 
there is a clique [i — kC — 1, i] of size kC + 2, that is, if Lmn(i) < i — kC — 1, then we 
mark the primarily forbidden vertex i — 1 by setting F{i — 1) = 1. 

In phase 2, we mark the secondarily forbidden vertices. It is enough to identify all the 
leaders because their followers are all the secondarily forbidden vertices. We identify the 
leaders and mark their followers in an interleaved manner in a single traversal through the 
vertices. If i is identified as a leader, we mark its rightmost follower i — C immediately, 
we mark the second rightmost follower i — 2C when we visit i — C, and so on. 

Note that the rightmost vertex n is never forbidden and hence is not a leader. So in 
phase 2, we visit each vertex i starting with the second rightmost. We check if the vertex 
is a follower of a already discovered leader. This is easy to do, we merely check if i is a 
follower of a vertex that is not far, i.e., at most a distance (k — 1)C from i. Then we 
mark i — C '. For this we maintain the auxiliary array Ldist(l, . . . , n) where Ldist(i) is 
the distance of i from its leader if i is secondarily forbidden, unspecified otherwise. Then 
we also check if i is itself a leader. The condition for i being a leader is that there should 
exist a block of forbidden vertices and a clique [i — kC,j] where i — j + 1 < C. 

However, as following Lemma shows, we can assume without loss of generality that the 
block of forbidden vertices is left maximal. 

Lemma 21. In a PIG the vertex i is a leader if and only if the left maximal forbidden 
block at i is [j, i] and the block [i — kC, j] is a clique. 

Proof. [<=] If the block [i — kC, j] is a clique and [j, i] is a forbidden block then clearly 
i is a leader. 

[=>] If i is a leader then there exists j' such that < i — j' < C — 1, the block [j', i] is 
forbidden and the block [i — kC,j'] is a clique. If [j', i] is not left maximal then let [j, i] 
be the left maximal forbidden block at i. Then j < j' . Hence, [i — kC 7 j] is a subclique 
of [i-kC,f]. □ 

Now, checking if a vertex is a leader is easy. We need to know the leftmost endpoint 
j of a block of forbidden vertices ending at i, and whether a clique starting at i — kC 
ends at j. The leftmost endpoint of the forbidden block need not be calculated afresh for 
every i. If the leftmost endpoint j of the forbidden block ending at i is already calculated, 
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then wc only need to check if the forbidden block extends further on the left of j, when 
considering the vertex i. For this we maintain the auxiliary array Rnf (1, . . . ,n) where 
Rnf (i) is the rightmost non-forbidden vertex such that Rnf (i) ^ i. The elements of Rnf 
can be recursively computed as follows: Rnf (i) is the rightmost non- forbidden vertex x 
such that the block [x + 1, min{i, Rnf (i + 1)}] is forbidden. To make sure the existence 
of Rnf (i) for all 1 < i < n, We assume without loss of generality an imaginary vertex 
numbered that is not forbidden. 

Lemma 22. If the array Lmn for a PIG G is given, the algorithm MarkForbidden 
correctly marks the forbidden vertices of G in 0(n) time. 

Proof. It is easy to see that phase 1 of MarkForbidden correctly computes correct 
values of F for primarily forbidden vertices. Let us refer each iteration of the loop 
in phase 2 by the corresponding value of i. We now claim that, at the beginning of 
iteration i in phase 2, the array F contains correct values for all vertices in the range 
[max{l,i — C + l},n]. This in turn proves the correctness of MarkForbidden. We 
show by induction on i. 

For i = n there cannot be any secondarily forbidden vertex in the range \i — C + 1, n] . 
Hence the claim is trivially true. For i < n assume at the beginning of iteration i, the 
array F contains correct values in the range [i — C + l,n] (for simplicity we assume 
i > C + 1, the cases i =g C can be shown similarly). The vertex i — C is a secondarily 
forbidden vertex if and only if it is either the right most follower of i or it is a follower of 
some vertex on the right of i. In MarkForbidden we handle the second case first and 
update F accordingly. After adjusting Rnf (t) suitably, [Rnf (i) + 1, i] correctly denotes the 
left maximal forbidden block at i because by induction hypothesis the forbidden vertices 
in the range [i — C + 1, i] are already marked. If there is a clique [i — fcC, j + 1] then i is 
a leader and we update F for the rightmost follower i — Coii. Hence MarkForbidden 
correctly computes if i — C is a forbidden vertex in iteration i. Thus at the end of 
iteration i, i.e., at the beginning of iteration i — 1, the array F contains correct values 
in the range [(i — 1) — C + 1, n\. 

The pseudocode of Algorithm [1] clearly shows that MarkForbidden takes overall 
0(n) time. □ 
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7.5. Algorithm CombPart 

We now give our algorithm to solve the partition subproblem. The algorithm first 
marks all forbidden vertices and then forms blocks greedily such that no block ends at a 
forbidden vertex. We call this algorithm CombPart, which is shown in Algorithm^ 

Algorithm 2: CombPart 
Input : A PIG G = (V, E) 

Output: If G has a [k + 1, C]-partition; if Yes also output such a partition 

1 F(l, ... ,n) = array returned by MarkForbidden on G; u = 1; 

2 while ii ^ i! do 

3 v = min{u + C — 1, n}; 

4 while (v > u) and (F(v) = = 1) do v = v — 1; 

5 if v < u then return No else create part [u, v]; U = v + 1; 

6 return Yes; 



Lemma 23. Let B be any block, except the rightmost, created by CombPart. Let u be 
the leftmost vertex of B. Then the block of vertices [u + jC + \B\,u + jC + (C — 1)] are 
forbidden for < j < k — 1. 

Proof. If \B\ = C then the block [u + jC + \B\ , u + jC + (C - 1)] is empty and hence the 
lemma is vacuously true. So we assume \B\ < C. Note that u + \B\, . . . ,u + C — 1 are all 
forbidden because otherwise CombPart would have created the block B of bigger size. 
Also note that either B is the leftmost block or u — 1 is the rightmost vertex of a block. 
So without loss of generality we assume that u — 1 is not forbidden. 

It will be enough if we prove that for each \B\ < t C — 1 the vertex u + kC + t is a 
leader because then its followers P(u + kC + £), i.e., u + t, u + C + <,..., u + (k — 1)C + i 
are all forbidden. We prove by induction on t. 

Base case: t = C — 1. Since w + C — 1 is forbidden, its leader is the vertex v = 
u + C — 1 + qC for some < q < and the followers of u, the vertices in P(v), arc 
forbidden. But u — 1 is not forbidden, i.e., u — 1 $ P{v). Hence u — 1 < v — kC ', implying 
q > k — 1. Thus q = k, which implies our claim. 
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Induction case: suppose the claim is true for t = t' where \B\ < t' =S C — 1, i.e., the 
vertex u + kC + t' is a leader. We need to prove that u + kC + t' — 1 is also a leader. 

Since u+t'—l is forbidden, its leader is the vertex v = u+t'— 1+qC for some s$ q < k. 
Thus, if q = k then we are done. So assume that q < k, i.e., q = k — z,l =g z =g k. We 
will show that this leads to a contradiction. 

Since v is a leader, by Lemma 1211 for some vertex x, the block F\ = [x, v] is the 
left maximal forbidden block at v and the block Q = [v — kC, x] is a clique. Again, by 
induction hypothesis, each of the vertices [u + kC + t'.u + (k + 1)C — 1] is a leader. 
Thus, the set of vertices F 2 = [u + qC + t' ,u + (q + 1)C — 1] is forbidden. But can 
be rewritten as [v + 1, u + (q + 1)C - 1]. Thus F = Fx u F 2 = [x, u + (q + l)C - 1] is a 
left maximal forbidden block at u + (q + 1)C — 1. By applying Lemma |2"T1 to Q and F, 
the vertex u + (q + 1)C — 1 is a leader. Among its followers, P(u + (q + 1)C — 1), the 
zth from the left is u + (q — k + z)C — 1 = u — 1. This is a contradiction because u — 1 
is not forbidden. □ 

Lemma 24. Suppose CombPart creates consecutive blocks B±, . . . ,Bk+i- Let u be the 
leftmost vertex of B\. Let Sj = C — \Bi\ and Si = Y^j=i s j- Then the Sk consecutive 
vertices [u + kC — Sk,u + kC — 1] are all forbidden. 

Proof. For 1 < i ^ fc, let m be the leftmost vertex of block Bi. Note Ui = u + (i — 1)C — 
Si-i where = 0. 

Applying Lemma [23] to Bi for all 1 < i < k and considering the set of consecutive 
forbidden vertices f$ corresponding to j = k — i + 1 we get Fi = [ui + (k — i + 1)C — 
Si,Ui + {k — i+ l)C - 1] = [u + kC - S l ,u + kC + Si-i - 1]. 

The set yj\ =k Fi is indeed the required set of consecutive forbidden vertices. □ 

Lemma 25. If an interval representation for a PIG G is given then CombPart correctly 
solves the partition subproblem on G in 0(n) time. 

Proof. If CombPart outputs NO, then there is a set of C consecutive forbidden vertices. 

To cover these vertices we need a block of size at least C + 1. So there cannot be any 

valid partition. Hence CombPart is correct. 

Now we prove that if CombPart outputs Yes then the partition generated is a valid 

partition. Since the algorithm generates blocks of size at most C, the size constraint 
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is satisfied. We only need to prove that no clique intersects more than k + 1 blocks 
generated by CombPart. We prove this by contradiction. 

Suppose there is a clique Q that intersects k + 2 blocks Bo, Bx, ... , Bk+x- Without loss 
of generality, we assume that only the leftmost vertex of Q is covered by Bo and only the 
rightmost vertex of Q is covered by Bk+x- Because, otherwise we can take a sub-clique 
Q' c Q with this property. Also let v be the leftmost vertex of Q, i.e., rightmost vertex 
of Bo and hence not forbidden. Let Sj = C — \Bi\ and Si = 2j=i s j f° r 1 ^ i ^ k. Let u 
be the leftmost vertex of B\ . 

Note \Q\ =g kC + 1 because otherwise v would be forbidden. By Lemma HM1 there 
are Sk consecutive forbidden vertices. Since the algorithm outputs Yes, Sk < C. Hence 
IQI = 2 + Zti 1^1 = 2 + ELi(<? - s *) = 2 + fcC - S k > (k - l)C + 2. 

So Q has size kC + 2 — Sk where 1 < Sk ^ C — 1 and by Lemma [24J there are 5^ 
consecutive forbidden vertices [u + fcC — 5fe, u + kC — 1] = [v + kC + 1 — Sk, v + kC]. 
By Lemma [2^1 v is forbidden. It is a contradiction. 

If an interval representation is given then we can easily find the maximal cliques and 
hence can compute the array Lmn in 0(n) time. By Lemma [221 the marking of forbidden 
vertices takes time 0(n). The greedy procedure for generating the parts also takes 0{n) 
time. Overall time taken is 0{n). □ 

Combining Lemma IT5l Lemma [231 and the discussions at the end of the subsection !7.2l 
we get a proof of Theorem [TJ 

8. Algorithm for Splittable Weighted Problem on PIGs 

Lemma 26. A weighted graph G = (V, E, W) is [A, C]-split colorable if and only if 
WXP(G) is [X,C]-colorable. 

Proof. Let G' = (V',E') = WXP{G). 

\^=\ The weighted graph G", obtained by putting weight 1 to every vertex of G', 
is also a weight-split graph of G. So if G' is [A, C]-colorable then G" and G both are 
[A, C]-split colorable. 

[=>] Let the weighted graph G"(V", E", W") be the weight-split graph corresponding 
to the [A, C]-split coloring of G. Then G' is the weight-expanded graph of G" too. A 
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[A, C]-coloring of G' can be obtained by assigning the vertices in G' corresponding to a 
vertex v" in G" the same color of v". □ 

Thus solving the split coloring problem on a weighted PIG G = (V, E, W) is equivalent 
to solving the unweighted coloring problem on WX P(G). In the rest of the section we will 
use G' = (V, E') to represent W XP(G). Applying the algorithm described in Section[7] 
on G' gives correct result but it makes the algorithm pseudo-polynomial as it takes O(n') 
time, proportional to the sum of weights. This is mainly because the algorithm iterates 
over each vertex in G' . 

However, it turns out that iterating over each vertex in G' is not necessary. The 
forbidden vertices in G' can be divided into blocks such that if the vertices u and v are 
in the same block b then leader of u and leader of v are in the same block I. We call such 
forbidden blocks FBs. Parallel to the vertices, we say that FB I is the leader of FB b and 
b is the follower of /. It can be seen that all vertices in an FB can be marked together. 
Hence it is enough to iterate through the FBs instead of iterating through the vertices 
of G'. 

8.1. Marking forbidden blocks 

We now modify the algorithm presented in Section [7] to let it work with FBs in- 
stead of forbidden vertices. The modified algorithm to mark all the FBs, which we call 
SplitMark, is shown in Algorithm [31 

We use the following correspondence between a vertex v e V and a vertex v' e V . 
The vertex v' = h(v,q) if v' is the qth copy of v where 1 ^ q ^ W(v) and v = h(v') if 
v' is a copy of v. The set {h(v, 1), . . . , h(v, W(v))} of copies of v is represented by H(v). 
As usual, we will interchangeably use 1 < v' < n' (1 < v < n) to denote a vertex v' e V' 
(v e V) as well as its position in the canonical ordering of vertices in G' (G). We also 
use an auxiliary array Z(0, . . . , n) such that Z(0) = and for all v > 0, the entry Z(v) 
denotes the rightmost copy of v in G', i.e., h(v, W(v)). Since all the copies h(v, t) e V' of 
v e V appear consecutively in the canonical ordering of G', we have Z(v) = 2<=i W(i). 
Note that given Z, the values of the function h(u) for all u belonging to a subset of 
vertices S E V', can be computed in right to left order, in overall 0(151 + n) time. 
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Algorithm 3: SplitMark 



Input : Maximal cliques of a PIG G, Lmn(l, . . . , n), Z(0, . . . , n) 
Output: Doubly linked list of FBs F in G' = WXP{G) 
1 foreach maximal clique [u,v] in G do /* phase 1 */ 



if Z(v)-Z(u-l)-{kC+2) > then /* [Z{u-1)+1, Z{v)\ e G' = [u, v] e G */ 
F.Inlay(Z(w) - 1, Z(v) - Z(u - 1) - (kC + 2) + 1, 0); 



4 ! = F.end^prev; /* F.end^prev is the rightmost FB */ 

5 while i F.begin do /* phase 2 */ 



6 
7 
8 
9 
10 

11 
12 
13 
14 

15 



v = bright; 

j = i^rnf = j^next^rnf ; if v < i^rnf aright then j = i^rnf = i; 
while j^prev^right == j^right — j^size do i^rnf = j; j = j^prev; 
if i->ldist ^ (k — 1)C then /* i is a follower block */ 

i 7 '.Inlay(w — C, i^size, i^ldist + G); 

u = i->rnf aright — i^rnf -^size + 1; / = v — u + 1; 

s= (v— Lmn(/i(it)) + l) — (fcG+2); /* function h is computed using Z */ 
if / + s ^ then /* new leader block ending at v */ 

F.Inlay(u-G,/ + s + l,G); 

i = i^prev; 



We store the information about the FBs in a linked list. Thus F is a now a doubly 

linked list of non-intersecting FBs sorted according to canonical ordering. We also keep 

the information stored in auxiliary arrays Ldist and Rnf earlier, in the list F itself. Thus 

each entry b of F has the following fields: (i) right denotes the rightmost vertex of the 

FB 6, (ii) size denotes the size of 6, and (iii) ldist denotes the distance of 6. right from 

its leader, (iv) rnf points to the leftmost FB such that all FBs between b. rnf and b are 

consecutive, i.e., all vertices in [6.rnf->right — fe.rnf^size + 1,6. right] are forbidden, 

(v) prev points to the FB on the left of b, and (vi) next points to the FB on the right of 

b. Note that we use the notation p^q to represent the field q of the FB pointed by the 

pointer p. We keep two sentinel FBs in F always, (i) the leftmost FB [—2, —1] and (ii) 

the rightmost FB [Z(n) + 2, Z(n) + 3] each having ldist = and rnf pointing to itself. 
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Two pointers Kbegin and F.end point to these two FBs, respectively. 

In addition to the standard operations of insert, delete and both way traversals though 
the list, we define a new operation on F which we call F. Inlay (rt, sz, Id). This operation 
inserts a new FB b with b. right = rt, 6. size = sz, 6.1dist = Id into F but makes sure 
that the FBs in F remain non-intersecting and sorted. Let b\ , . . . , b s be the FBs in F 
which intersect b. The operation Inlay does the following: (i) deletes all the FBs in 
F which are subsets of b, (ii) if b partly intersects 61, i.e., if t = (b. right — 6. size — 
61. right) < 61. size then updates 61. size = b±. size — t, 61. right = 61. right — t, (iii) 
if b partly intersects b s , i.e., if b s . right > b. right then updates 6 s .size = b s . right — 
b. right, and (iv) inserts b at its proper position in F. 

We slightly modify the definition of the array Lmn(l, . . . , n). Now Lmn(u) denotes the 
leftmost neighbor of Z(v) in G . If the maximal cliques of G are given, then the elements 
of Lmn can be computed in 0(n) time. 

In phase 1 we mark the FBs due to the primarily forbidden vertices given by the 
following lemma: 

Lemma 27. Let a PIG have a maximal clique Q with rightmost vertex v and let s = 
\Q\ — (kC + 2) ^ 0. Then the set of all primarily forbidden vertices in Q is exactly the 
FB [v-s- l,v- 1]. 

Proof. Let u be the leftmost vertex of Q. The only subcliques of Q which satisfy the 
conditions of Lemma [T51 are Q p = [u + p, u + kC + 1 +p] for all < p < s. The primarily 
forbidden vertex for Q p is u + kC + p. Thus the set of all phase 1 forbidden vertices for 
Q is {u + kC + p, sS p sS s}, i.e., the FB [v - s - l,v - 1]. □ 

So in phase 1 we go through the maximal cliques of G' which have one-to-one cor- 
respondence with the maximal cliques in G and mark the FBs for the primarily for- 
bidden vertices. Note that the maximal clique [u, v] e G corresponds to the clique 
[Z(u-1) + l,Z(v)] e G 1 . 

It is clear that in phase 2 we need not check for leaders at the vertices which are 
not forbidden; checking only at the rightmost vertex in each FB suffices. The following 
lemma determines the size of the leader block ending at the rightmost vertex in a FB. 
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Lemma 28. Let a PIG have a left maximal block B of forbidden vertices with the right- 
most vertex v and size f where 1 ^ / < C . Let Q be the largest clique with the rightmost 
vertex v — f + 1. Let s = \Q\ — (kC + 2). Then the leaders in B are exactly the FB 
[v - f - s,v]. 

Proof. Let u be the leftmost vertex of Q. Thus u = v — / + 1 — |<3| + 1 = u — kC — (/ + s). 
Consider any vertex v' e B. If / + s < then u > v — fcC 5= v' — kC. Since Q is the 
largest possible there cannot be a clique [v' — kC, v — f + 1] . Hence by Lemma [2TJ the 
vertex t/ is not a leader. So we assume / + s > 0. 

Note that s < because otherwise v — f would be a primarily forbidden vertex on 
the immediate left of B which is not possible as B is left maximal. Thus [v — f — s, v] 
is a subset of B. Now for all < p / + s, consider the vertex v p = v — p. Since 
v p — kC = v — kC — p > u, the block [v p — kC, v — f + 1] is a subclique of Q. Thus 
by Lemma |21[ the vertex v p is a leader. Since Q is the largest possible, [v — f — s,v] is 
exactly the set of leaders in B. □ 

Thus in phase 2 we visit each FB i starting with the rightmost FB created in phase 1. 
We check if i is a follower of some previously discovered FB I at a distance at most 
(k — 1)C and mark the next follower of I on the left of i. We also check if i is a leader 
itself using Lemma [551 If a leader FB I is identified then we insert the rightmost follower 
of I, and so on, similar to secondarily forbidden vertices in Section [7] 

Lemma 29. // the arrays Z, Lmn and the maximal cliques of a PIG G are given, then 
SplitMark correctly marks the FBs of G' = WXP(G) in 6(n 2 ) time. 

Proof. It is easy to see that phase 1 of SplitMark correctly inserts into F the FBs due 

to the primarily forbidden vertices while ensuring that the FBs in F are non-intersecting 

and sorted in canonical ordering. Let us refer each iteration of the loop in phase 2 by the 

corresponding value of i. We now claim that, at the beginning of iteration i in phase 2, 

the list F correctly contains all FBs in the range [max{l, bright — C + 1}, n']. This in 

turn proves the correctness of SplitMark. We show by induction on i. 

For i pointing to the rightmost FB there cannot be any secondarily forbidden vertex 

in the range [i->right — C + l,n\. Hence the claim is trivially true. For other values 

of i, assume at the beginning of iteration i, the list F correctly contains FBs in the range 

27 



[z->right — C + 1, n] (for simplicity we assume i-»right > C + 1, the cases i->right sg (7 
can be shown similarly). In iteration i SplitMark correctly inserts a new FB [i->right— 
i^size — C + 1, i->right — C] or [i-s-right — (/ + s + 1) — C+ 1, i-s-right — C] depending 
upon whether (i) FB i is a follower of a previously discovered FB or (ii) there is a 
leader FB with rightmost vertex bright, given by Lemma 1281 In case (i) there are 
non forbidden vertex in [&->right + 1, bright — i^slze]. Hence there can not be any 
secondarily forbidden vertex in the [&->right — C + 1, i->right — i->size — C]. Thus at 
the end of iteration i, the list F correctly contains FBs in the range [6->right — C + l, n']. 
Similarly in case (ii) there can not be any new secondarily forbidden vertex [bright — 
C + 1, bright — (/ + s + 1) — C] and hence at the end of iteration i, the list F correctly 
contains FBs in the range [6->right — C + l,n']. At the end of iteration new value of i 
is b. Hence our claim is true at the beginning of the next iteration too. 

Note that for each FB the algorithm takes 0(1) time except the operation Inlay. 
Note that since the operation Inlay is invoked with FBs in right to left order, it can be 
implemented by maintaining an extra pointer that traverses through the FBs in right to 
left order, in overall 0(|-F|) time. If the weight of each vertex is at most then the set 
of vertices H(v) for v in G can contain at most one follower of each leader FB. There can 
be as many leader FBs as the number of maximal cliques in G", i.e., at most n. Hence 
\F\ = 0(n 2 ). Thus, time complexity is 0(n 2 ). 

Now we show that there is a class of PIGs for which SplitMark takes tt(n 2 ) time. 
The class of PIGs is obtained by varying some parameter t. A PIG G in this class has 
n = it vertices given by the intervals I\ , . . . , I%t where for 1 < j < t + 1 the interval 
Ij = [j,2t + 2i— 1] has weight 2, for 2 < j t the interval I t+J = [t + l,4i + j] has 
weight 2t, and again for 1 ^ j < t the interval ht+j = [2t + 2i, 5t + j] has weight 2. 
Clearly G' = WXP(G) has (t + 1) * 2 + (t - 1) * 2t + t * 2 = 2(t 2 + t + 1) vertices and 
each of the t + 1 maximal cliques [2j — l,2j + 2t 2 ], 1 < j < t + 1, has size 2t 2 + 2. 
For C = 2t, we have k(G') = t. In phase 1 SplitMark creates t + 1 FBs each having 
a single vertex 2j + 2t 2 — 1 for all 1 s£ j t + 1. In phase 2 SplitMark creates a 

2 If the weights are unrestricted, then we can still solve the decision version of the problem in 0(n 2 ) 
time by considering only the interesting FBs that fall within the set H (v) for a vertex v in G. We omit 
the details here. 
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FB from each of the remaining odd numbered vertices in G' . Thus the total number of 
FBs created by SplitM ARK on G is equal to the number of odd vertices in G' . Hence 
\F\ = t 2 + t + l = (n/3) 2 + rj/3 + l = n(n 2 ). Thus SplitMark takes fJ(ri 2 ) time on G. □ 

8.2. Algorithm SplitPart 

We now give the modifications to CombPart to use the FBs. We call this modified 
algorithm Split Part, which is shown in Algorithm [4] 

Algorithm 4: SplitPart 
Input : A PIG G = (V, E, W) 

Output: If G'=WXP{G) has a [fc+1, C]-partition; if Yes also output the partition 

1 F = list of FBs returned by SplitMark on G; u = 1; i = f.begin^next; 

2 while u =g n' do 

3 v = min{?! + C — 1, n'}; 

4 while i->right < v do i = i->next; 

5 v = i->right — i->size; 

6 if v < u then return No else create part [u,v]; u = v + 1; 

7 return Yes; 



Lemma 30. // an interval representation for a weighted PIG G is given then SplitPart 
correctly solves the partition subproblem on WXP(G) in 0(n 2 ) time. 

Proof. Given that SplitMark correctly marks the forbidden blocks of G' = WXP(G), 
it is easy to see that SplitPart generates the same partition that CombPart would 
have generated on G' . Given an interval representation of G, Z, Lmn and maximal cliques 
of G can be computed in 0(n) time. Thus by Lemma [^91 computing F takes 0(n 2 ) time. 
The block generation step also takes 0(\F\) = 0(n 2 ) time. □ 

Combining Lemmas [15j [251 EHl an d the discussions at the end of Subsection 17.21 we 
get a proof of Theorem [5] Note that SimplePart can be slightly modified to use vertices 
in WXP(G) but still taking Q(n) time. 
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9. A 2-approximation Algorithm for Weighted Problem on PIGs 

Lemma 31. There exists a polynomial time algorithm for the non-splittable weighted 
partition problem that generates a [A, C]-partition on a PIG G such that A is at most 2 
times the clique intersection of the partition generated by an optimal algorithm on G. 

Proof. We first solve the corresponding splittable weighted problem on G in 0(n 2 ) time 
using the algorithm described in Section[3] Let the blocks in the [A', C]-partition created 
by the algorithm be V 1 = {Pi, P%, . . . , Pf}. Note that A' is a lower bound on the clique 
intersection A* of the partition generated by any optimal algorithm on G. 

Since the weight of a vertex in a non-splittable problem is at most C, a vertex of 
G is split into at most two consecutive blocks Pi and Pj+i. We convert the splittable 
partition V' into a non-splittable partition V in 0(n) time as follows. Consider each 
vertex v left to right. If v is split into blocks Pi,Pi + i and v cannot be put completely 
in Pi then create a copy P[ of Pj, insert P/ in between Pi and Pj+i, put v completely 
in P/ and repeat with the rest of the vertices. Note that V contains at most 2 copies of 
each block Pi and hence V is a [A, C]-partition with clique intersection A ^ 2A' < 2A*. 
Overall it takes 0(n 2 ) time. □ 

Combining Lemma [T51 and Lemma |3H we get a proof of Theorem [31 

10. Partition Problem on Split Graphs 

Since split graphs are also chordal, solving the partition (not block-partition) problem 
is enough. It can be noted that the same lower bound of Lemma [T71 applies here too. 

10.1. Upper Bound 

Lemma 32. Let u) be the clique number of a split graph G. There exists a polynomial 
time algorithm that gives a [|w/C] + 1, C]-partition for G. 

Proof. Let the vertex set of G be split into clique Q and independent set S. Without 
loss of generality, we assume that Q is a maximum clique. Because otherwise \Q\ = uj — 1 
and we can move a vertex in S that is adjacent to all vertices in Q to Q. Now consider 
the following partition of vertices: LT = {Pi, P2, . . . , Pt} u {{f}|f e S} where t = |w/C], 
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and {Pi]\ =l is an arbitrary partition of Q such that |Pj| = C for all i = 1, . . . , (t — 1). 
Note that this partition can be created in polynomial time. Each part is connected and 
has at most C vertices. Moreover, any maximal clique in G intersects at most t + 1 parts. 
Thus, IT is a [\oj/C\ + 1, C]- partition. □ 

10.2. NP-hardness 

Since the upper bound and the lower bound differ by 1, it is enough to decide if G has 
a [[cj/C], C]-partition or not. If the answer is Yes then we have an optimal solution to 
the partition problem with clique intersection A = \w/C\. Otherwise the partition given 
in the proof of Lemma l32l gives an optimal solution with clique intersection A = [w/C] + 1. 
Thus Lemma [33J directly gives a proof of Theorem 

Lemma 33. The problem of deciding if a split graph G has a [\uj(G)/C],C]-partition 
for C ^ 2 is NP-complete. 

Proof. We show that the decision problem is NP-complete even for C = 2. We call the 
problem in this special case as CP. First we show that CP is in NP. A maximal clique 
in G is either Q or the closed neighborhood of a vertex in S. So the maximal cliques in 
G can be found in polynomial time. Suppose a partition of the vertices is given. Size 
constraints can be easily checked. Each part contains a single vertex or a pair of vertices. 
A single vertex is trivially connected. Connectedness of a part of size 2 can be checked by 
just checking if there is an edge between the two vertices. Clique intersection constraint 
can also be checked in polynomial time. 

We now introduce a set partitioning problem (SP) is defined as follows. Given a set of 
2n elements e\, e%, . . . , ei n and a collection of m subsets S\, S2, ■ ■ ■ > S m , can the elements 
be partitioned into n groups of size 2 such that each subset has both elements of at least 
one group? 

We complete the proof by first showing a polynomial time reduction from SP to CP 
(Lemma I34[) and then a polynomial time reduction from the well known NP-complete 
problem SAT to SP (Lemma EH)- □ 

Lemma 34. SP s= P CP. 
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Proof. Given an instance of SP, we construct an instance of CP as follows. The complete 
set Q has a vertex Vi corresponding to each element ej and the independent set S has a 
vertex Wj corresponding to each subset Sj. There is an edge between Vi and Wj if and 
only if e, $ Sj. Clearly uj = 2n and hence \uj/C] = n. 

Suppose there is a Yes solution to the SP instance where the groups are G\, G2, ■ ■ ■ , G n 
Then create a partition n = {P\, P2, . . . , P n }u {wj}™ =1 for the CP instance where e Pi 
if and only if et e Gi. Clearly each part is connected and has at most 2 vertices. Clique 
intersection constraint is satisfied for Q. Since Sj contains both elements of at least one 
Gi, the maximal clique Q' containing Wj does not intersect at least one part P,. Including 
the part {wj}, Q' intersects at most (n — 1) + 1 = n parts. Hence n is a [n, 2]-partition. 

On the other hand, suppose there is a Yes solution to the CP instance. Since the 
clique intersection constraint is satisfied for Q, the vertices of Q are divided into parts of 
size exactly 2. These parts give the required groups of SP because, for a maximal clique 
Q' containing Wj has clique intersection at most n, and hence it must not intersect with 
at least one part Pi which implies that Sj contains both elements of Gi. □ 

Lemma 35. SAT ^ P SP. 

Proof. Suppose an instance of SAT has p Boolean variables x\, X2, ■ ■ ■ , x p and q clauses 
C\, C2, ■ ■ ■ , C q . Without loss of generality, we assume that there is at most one literal for 
each variable in each clause. Now we construct an instance of SP as follows. There are 4p 
elements x\, x[, T\, iq, X2, x' 2 , T 2 , F 2 , . . . , x p , x' pl T pi F p and the subsets are of two types 
as follows: (1) the subsets {xi, a;-, TJ, {xi, x' i7 Fi}, {xi, T i7 Fi}, {x^Ti, Fi} for all 1 < i ^ p, 
and (2) the subset uj^Cj {h, Ti} for all clause Cj, where k is either Xi or x\ (e.g., for 
Cj = (xi + x' 2 + x 3 ) the subset {x\, Ti, x' 2 , T 2 , x 3 , T 3 }). 

Suppose there is a satisfying assignment for the SAT instance. Then construct a 
grouping for the SP instance as follows. For all i, if Xi is true then construct two groups 
{xi, Ti} and {a^, Fi}; otherwise (i.e., Xi is false) construct two groups {xi, Fi} and {x' i7 Ti}. 
Clearly each subset of type (1) has two elements belonging to the same group. Since each 
clause is satisfied there must be a variable Xi such that one of Xi and x ■ is true. Hence the 
corresponding subset of type (2) must have two elements belonging to the same group. 

On the other hand, suppose there is a Yes solution for the SP instance. The subsets 

of type (1) force the elements Xi , x ■ , Ti , Fj to form 2 groups amongst themselves. The 
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subsets of type (2) ensure that one of the literals k in the clause Cj must group with T, and 
hence Cj must be true. This implies that the SAT instance has an satisfying assignment 
implied by the groups. For some Xi the grouping may contain {x i} x^}, {Tj, Fj}, in which 
case the value for the variable Xi can be chosen arbitrarily. □ 

11. Conclusions and Future Work 

We gave polynomial time algorithms for unweighted and splittable weighted versions 
of the component coloring problem for proper interval graphs and showed that it is NP- 
hard for split graphs. However the complexity of both the versions are not known for 
general interval graphs. We would like to get polynomial time algorithms for general 
interval graphs using similar ideas. This may lead to a constant factor approximation 
algorithm for the weighted version of the problem for general interval graphs which is 
known to be NP-hard, using ideas from Bin-packing. 
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